Multivariate Bernoulli Distribution

نویسندگان

  • Bin Dai
  • Shilin Ding
  • Grace Wahba
  • BIN DAI
  • SHILIN DING
  • GRACE WAHBA
چکیده

In this paper, we consider the multivariate Bernoulli distribution as a model to estimate the structure of graphs with binary nodes. This distribution is discussed in the framework of the exponential family, and its statistical properties regarding independence of the nodes are demonstrated. Importantly the model can estimate not only the main effects and pairwise interactions among the nodes but also is capable of modeling higher order interactions, allowing for the existence of complex clique effects. We compare the multivariate Bernoulli model with existing graphical inference models – the Ising model and the multivariate Gaussian model, where only the pairwise interactions are considered. On the other hand, the multivariate Bernoulli distribution has an interesting property in that independence and uncorrelatedness of the component random variables are equivalent. Both the marginal and conditional distributions of a subset of variables in the multivariate Bernoulli distribution still follow the multivariate Bernoulli distribution. Furthermore, the multivariate Bernoulli logistic model is developed under generalized linear model theory by utilizing the canonical link function in order to include covariate information on the nodes, edges and cliques. We also consider variable selection techniques such as LASSO in the logistic model to impose sparsity structure on the graph. Finally, we discuss extending the smoothing spline ANOVA approach to the multivariate Bernoulli logistic model to enable estimation of non-linear effects of the predictor variables.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

A Control Chart to Monitor a Multivariate Binomial Process

The most applied statistical methods for monitoring multivariate attribute processes have been developed assuming that they have a multinomial distribution, see e.g. Marcucci (1985) and Cassady and Nachlas (2006). However this assumption is not always reasonable; indeed, it is more general and correct to suppose that in each item it is possible to identify one or more of k ordered and not mutua...

متن کامل

Polya’s Urn and the Beta-bernoulli Process

The Polya’s Urn model is notable within statistics because it generalizes the binomial, hypergeometric, and beta-Bernoulli (beta-binomial) distributions through a single formula. In addition, Polya’s Urn is a multivariate distribution whose variables are exchangeable but not independent. This paper introduces basic probability and Bayesian concepts in order to prove these properties.

متن کامل

Bivariate Conway-Maxwell-Poisson distribution: Formulation, properties, and inference

The bivariate Poisson distribution is a popular distribution for modeling bivariate count data. Its basic assumptions and marginal equi-dispersion, however, may prove limiting in some contexts. To allow for data dispersion, we develop here a bivariate Conway–Maxwell–Poisson (COM–Poisson) distribution that includes the bivariate Poisson, bivariate Bernoulli, and bivariate geometric distributions...

متن کامل

Information bounds for Gaussian copulas.

Often of primary interest in the analysis of multivariate data are the copula parameters describing the dependence among the variables, rather than the univariate marginal distributions. Since the ranks of a multivariate dataset are invariant to changes in the univariate marginal distributions, rank-based estimators are natural candidates for semiparametric copula estimation. Asymptotic informa...

متن کامل

Multivariate Bernoulli and Euler polynomials via Lévy processes

By a symbolic method, we introduce multivariate Bernoulli and Euler polynomials as powers of polynomials whose coefficients involve multivariate Lévy processes. Many properties of these polynomials are stated straightforwardly thanks to this representation, which could be easily implemented in any symbolic manipulation system. A very simple relation between these two families of multivariate po...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2012